366 research outputs found

    Multiscale Feature Attribution for Outliers

    Full text link
    Machine learning techniques can automatically identify outliers in massive datasets, much faster and more reproducible than human inspection ever could. But finding such outliers immediately leads to the question: which features render this input anomalous? We propose a new feature attribution method, Inverse Multiscale Occlusion, that is specifically designed for outliers, for which we have little knowledge of the type of features we want to identify and expect that the model performance is questionable because anomalous test data likely exceed the limits of the training data. We demonstrate our method on outliers detected in galaxy spectra from the Dark Energy Survey Instrument and find its results to be much more interpretable than alternative attribution approaches.Comment: 6 pages, 2 figures, accepted to NeurIPS 2023 Workshop on Machine Learning and the Physical Sciences. Code available at https://github.com/al-jshen/im

    Plausible Adversarial Attacks on Direct Parameter Inference Models in Astrophysics

    Full text link
    In this abstract we explore the possibility of introducing biases in physical parameter inference models from adversarial-type attacks. In particular, we inject small amplitude systematics into inputs to a mixture density networks tasked with inferring cosmological parameters from observed data. The systematics are constructed analogously to white-box adversarial attacks. We find that the analysis network can be tricked into spurious detection of new physics in cases where standard cosmological estimators would be insensitive. This calls into question the robustness of such networks and their utility for reliably detecting new physics.Comment: Accepted submission to Machine Learning and the Physical Sciences workshop, NeurIPS 202

    Spotting Hallucinations in Inverse Problems with Data-Driven Priors

    Full text link
    Hallucinations are an inescapable consequence of solving inverse problems with deep neural networks. The expressiveness of recent generative models is the reason why they can yield results far superior to conventional regularizers; it can also lead to realistic-looking but incorrect features, potentially undermining the trust in important aspects of the reconstruction. We present a practical and computationally efficient method to determine, which regions in the solutions of inverse problems with data-driven priors are prone to hallucinations. By computing the diagonal elements of the Fisher information matrix of the likelihood and the data-driven prior separately, we can flag regions where the information is prior-dominated. Our diagnostic can directly be compared to the reconstructed solutions and enables users to decide if measurements in such regions are robust for their application. Our method scales linearly with the number of parameters and is thus applicable in high-dimensional settings, allowing it to be rolled out broadly for the large-volume data products of future wide-field surveys.Comment: 7 pages 3 figures, Accepted at the ICML 2023 Workshop on Machine Learning for Astrophysic

    First measurement of gravitational lensing by cosmic voids in SDSS

    Full text link
    We report the first measurement of the diminutive lensing signal arising from matter underdensities associated with cosmic voids. While undetectable individually, by stacking the weak gravitational shear estimates around 901 voids detected in SDSS DR7 by Sutter et al. (2012a), we find substantial evidence for a depression of the lensing signal compared to the cosmic mean. This depression is most pronounced at the void radius, in agreement with analytical models of void matter profiles. Even with the largest void sample and imaging survey available today, we cannot put useful constraints on the radial dark-matter void profile. We invite independent investigations of our findings by releasing data and analysis code to the public at https://github.com/pmelchior/void-lensingComment: 6 pages, 5 figures, as accepted by MNRA

    Lightweight starshade position sensing with convolutional neural networks and simulation-based inference

    Full text link
    Starshades are a leading technology to enable the direct detection and spectroscopic characterization of Earth-like exoplanets. To keep the starshade and telescope aligned over large separations, reliable sensing of the peak of the diffracted light of the occluded star is required. Current techniques rely on image matching or model fitting, both of which put substantial computational burdens on resource-limited spacecraft computers. We present a lightweight image processing method based on a convolutional neural network paired with a simulation-based inference technique to estimate the position of the spot of Arago and its uncertainty. The method achieves an accuracy of a few centimeters across the entire pupil plane, while only requiring 1.6 MB in stored data structures and 5.3 MFLOPs (million floating point operations) per image at test time. By deploying our method at the Princeton Starshade Testbed, we demonstrate that the neural network can be trained on simulated images and used on real images, and that it can successfully be integrated in the control system for closed-loop formation flying.Comment: submitted to JATI

    Parametrising arbitrary galaxy morphologies: potentials and pitfalls

    Full text link
    We demonstrate that morphological observables (e.g. steepness of the radial light profile, ellipticity, asymmetry) are intertwined and cannot be measured independently of each other. We present strong arguments in favour of model-based parametrisation schemes, namely reliability assessment, disentanglement of morphological observables, and PSF modelling. Furthermore, we demonstrate that estimates of the concentration and Sersic index obtained from the Zurich Structure & Morphology catalogue are in excellent agreement with theoretical predictions. We also demonstrate that the incautious use of the concentration index for classification purposes can cause a severe loss of the discriminative information contained in a given data sample. Moreover, we show that, for poorly resolved galaxies, concentration index and M_20 suffer from strong discontinuities, i.e. similar morphologies are not necessarily mapped to neighbouring points in the parameter space. This limits the reliability of these parameters for classification purposes. Two-dimensional Sersic profiles accounting for centroid and ellipticity are identified as the currently most reliable parametrisation scheme in the regime of intermediate signal-to-noise ratios and resolutions, where asymmetries and substructures do not play an important role. We argue that basis functions provide good parametrisation schemes in the regimes of high signal-to-noise ratios and resolutions. Concerning Sersic profiles, we show that scale radii cannot be compared directly for profiles of different Sersic indices. Furthermore, we show that parameter spaces are typically highly nonlinear. This implies that significant caution is required when distance-based classificaton methods are used.Comment: 18 pages, 13 figure

    Density mapping with weak lensing and phase information

    Get PDF
    The available probes of the large scale structure in the Universe have distinct properties: galaxies are a high resolution but biased tracer of mass, while weak lensing avoids such biases but, due to low signal-to-noise ratio, has poor resolution. We investigate reconstructing the projected density field using the complementarity of weak lensing and galaxy positions. We propose a maximum-probability reconstruction of the 2D lensing convergence with a likelihood term for shear data and a prior on the Fourier phases constructed from the galaxy positions. By considering only the phases of the galaxy field, we evade the unknown value of the bias and allow it to be calibrated by lensing on a mode-by-mode basis. By applying this method to a realistic simulated galaxy shear catalogue, we find that a weak prior on phases provides a good quality reconstruction down to scales beyond l=1000, far into the noise domain of the lensing signal alone.Comment: 11 pages, 9 figures, published in MNRA
    corecore